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(54) Adapter card slot isolation for hot plugging 

(57) A computer system is provided with at least one 
connector slot (4, 4a) for receiving a feature card (5), 
that implements specific functions such as I/O, memory 
or the like. When alteration of the hardware configura- 
tion is desired a user causes a reset control signal to be 
issued from an I/O bridge chip (104). This reset control 
signal is used to initiate the functions of ceasing data 
processing activity for the card (5) to be removed de- 
coupling the slot (106) from the bus (102) and causing 
the electrical power to be gradually decreased. The re- 
set control signal then remains active until the original 



card(s) is removed and the new card is installed in the 
slot. Once the new card is mechanically installed in the 
connector (4, 4a) ( then power is brought up, the slot 
(106) is coupled to the bus (102) and the reset signal 
from the bridge chip (104) is deactivated. This allows 
the configuration software to begin data processing ac- 
tivity with the new card. In this manner, an individual slot 
or bank of slots can be isolated from other slots in the 
computer system, such that particular adapter cards can 
be changed without the need to power down the entire 
computer system. 
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Description 

Technical Field 

The present invention generally relates to removing 5 
and installing adapter, or feature cards in a computer 
system. More specifically, a system is disclosed which 
allows changing the adapter cards in the computer sys- 
tem without having to power down and/or remove the 
cover from the entire computer. 10 

Background Art 

Typical computer systems include a system. board 
which includes a microprocessor and other application . is 
specific integrated circuits (ASIC), such as memory con- , 
trollers, input/output (I/O) controllers, and the like, elec : 
trically connected to one another by wiring layers. Also, 
most computers include slots for additional adapter 
cards which can connect the chips on the cards to the 20 
microprocessor and/or other chips on the system board, 
to provide additional function to the computer system. 
Typical functions that a user might add to a computer 
include additional memory, fax/modem capability, sound 
cards, graphics cards or the like. The slots, included on 25 
the system board generally include in-line electrical con- 
nectors having electrically conductive lands which re- 
ceive exposed tabs on the adapter cards. The i/Os of 
the chips on the cards are connected to the tabs. The 
connector is then electrically connected to the micro- 30 
processor, or the like through the previously mentioned . 
wiring layers. 

In conventional computer systems, a user, must 
power off the system and first remove the cover from 
the entire computer system before the add itipnaj card .35 
(s) can be accessed. This is true whether an existing 
card is being removed and/or a new card is being added 
to the computer. Often, it is a time consuming operation 
to remove and replace the cover of the computer sys- 
tem. Several metal screws must be removed and then 40 
reinstalled, and the cover frequently requires very pre- 
cise alignment before it seats on the computer frame. 
Also, the actual installation of the card into the adapter 
slot can be a painstaking and time consuming operation,, 
since the user is required to precisely align the card and 45 
slot, without the aid of any type of alignment device, and 
exert sufficient (but not too much) pressure for electrical 
contact to be made, without damaging the. card or con- 
nector. ' ' .' ' - . 

Therefore, it can be seen that a need exists for a .so 
computer system which would allow a user to change 
the hardware configuration^ a computer by removing 
a feature card from and/or installing a feature card into 
a computer system without the heed of removing the ac- 
tual cover from the computer system, and powering ss 
down the entire system, or taking the computer off-line. 
Additionally, a system would be advantageous that 
would assist the user in aligning the card and connector 



to ensure proper electrical connection and avoid dam- 
age to either component. 

Disclosure of the Invention 

In contrast to the prior art, the present invention pro- 
vides a computer system which allows a user to remove 
or. install feature cards (i.e. change the hardware con- 
figuration of the computer) without powering down and/ 
or removing the cover of the entire computer system. 
The present invention allows individual connectors to be 
disabled such that specific feature cards can be re- 
moved or replaced, without the need for powering down 
the entire computer system. 

The invention provides a computer system, com- 
prising:^ CPU;.at least one I/O slot, electrically connect- 
ed to said CPU, for receiving a feature card; and means 
for changing a hardware configuration of said computer 
system by deactivating said at least one I/O slot while 
said CPU concurrently performs data processing oper- 
ations. 

The invention also provides a method of changing 
a hardware configuration in a computer system having 
a CPU, comprising the steps of: providing at least one 
I/O slot, electrically connected to said CPU, for receiving 
a feature card; and deactivating said at least one I/O slot 
while said CPU concurrently- performs data. processing 
operations. 

When alteration of the hardware configuration is de- 
sired a user causes a reset control signal to be issued 
from an I/O bridge chip. This reset control signal is used 
to initiate the functions of ceasing data processing ac- 
tivity for the card to be removed, decoupling the slot from 
. the bus and causing the electrical power to be gradually 
decreased. The reset control signal then remains active 
until the original card is removed and the new card is 
installed in the slot. Once the, new card is mechanically 
installed in the connector, then power is brought up, the 
slot is coupled to the bus and the reset signal from the 
bridge chip is deactivated. This allows the configuration 
software to begin data processing activity with the new 
card. In this manner, an individual slot, or bank of slots 
can be isolated from other slots in the computer system, 
such that particular adapter cards can be changed with- 
out the need to power down the entire computer system. 

Brief Description of the Drawings 

The invention will now be described, by way of ex- 
ample only, with reference to the accompanying draw- 
ings, in which: 

Figure 1 is a perspective view of a system board 
and an adapter card, and the mechanical relation- 
ship therebetween; 

Figure 2 is an elevation view of an adapter card with 
a corresponding attached guide member; 
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Figure 3 is a top view taken along section line A-A 
of Figure 2 of the adapter card and guide member 
of the present invention; 

Figure 4 is an elevation view of the adapter card and 
guide member of the present invention taken along 
section line B-B of Figure 2; 

Figure 5 is a perspective view of a computer system 
cover showing the slots which accommodate the 
adapter card and guide member of the present in- 
vention; 
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Figure 6 is schematic diagram of a computer system 
having a system bus connected to an I/O bus is 
through a host bridge, wherein a number of adapter 
slots are electrically connected to the 1/6 bus- 
Figure 7 is a schematic diagram showing the control 
signals required for operation of the adapter card 20 
slot aspect, including a bank of card slots, of the 
present invention; 

Figure B is a flow chart showing the various process 
steps implemented by the present invention to hot 2s 
plug the adapter cards into the I/O slots; 

Figure 9 is a schematic diagram illustrating the con- 
trol signals required for another aspect of the 
present invention wherein the I/O adapter slots are 30 
isolated to provide enhanced error recovery; 

Figure 10 is a flow chart of the steps needed to im- 
plement the error recovery aspects of the present 
invention; 35 

Figure 11 is a schematic diagram showing one ex- 
ample of a circuit that could be used by the present 
- invention to ramp up or down the power to an adapt- ' 
er card slot; 40 

Figure 12 is a block diagram illustrating one pre- 
ferred embodiment of the present invention wherein 
a bank of slots can be deactivated to install, remove 
or replace a card without requiring the entire system 45 
to be taken off line; and ■ • - 1 . 

Figure 13 is a flow chart showing the steps needed 1 
to remove, install or replace the cards in a bank of 
slots. so 

Detailed Description of the Invention 

Referring to Figure 1 , a perspective view of a sys- 
tem board 1 and feature, or adapter, card 5 is shown, ss 
Board 1 includes various integrated circuit chips such 
as a microprocessor 2, e.g. a PowerPC microprocessor 
available from the IBM Corporation (PowerPC is a trade- 



mark of IBM) and other application specific integrated 
circuits 3, such as a memory, I/O controller or the like. 
In line connectors 4 and 4a are also shown attached to 
system board 1 These connectors are electrically con- 
nected to the ICs on board 1 through wiring layers which 
are present in the system board. Electrically conductive 
lands 10 and 10a are present in connectors 4 and 4a 
which will interconnect with electrically conductive tabs 
on a feature card. The feature card 5, also known as 
device 5, is shown perspective^ and includes an inter- 
connection portion 8 having conductive tabs 9 therein. 
These tabs 9 will contact lands 10 in connector 4 such 
that electrical connection can then be made between the 
various components on system board 1 and the chips 
present on feature card 5. Chips 6 and 7 on feature card 
5 could be any one of a number of integrated circuits 
that will provide additional function to the computer sys- 
tem. For example, these chips 6 and 7 may be memory, 
graphics accelerator, math co-processor, modem, or the 
like ICs. Again, there are wiring layers present in feature 
card 5 which will connect chips 6 and 7 on feature card 
5 with microprocessor 2 and chip 3 on the system board * 
when card 5 is inserted into connector 4. Those skilled 
in the art will understand that card 5 and system board 
1 can be any one of a number of substrates, which in- 
clude layers of electrically conductive, and alternating 
insulating material, connected to one another through 
vias. The layers in board 1 and card 5 are brought out 
to surface pads and then connected.to the I/O points on 
the various chips by using one of the many interconnec- 
tion methods, such as controlled collapse chip connect 
(C4), solder ball connect (SBC) wire bonding, surface 
mount technology (SMT) or the like. 

Figure 2 shows a preferred embodiment of the guid- 
ing means of the present invention. Adapter card 5 is 
shown having electrically conductive tabs 9 in the same 
manner as described with regard to Figure 1. Addition- 
ally, a card guide 31 is shown which is affixed to card 5 
by friction fit, clamping, screws, or other attachment 
means: It should be noted that guide 31 can be short- 
ened, or otherwise modified to accommodate one-half 
sized adapter cards, which are common in the industry. 
The invention will be described using a full sized adapter 
card, but rfshould be understood that a half-size card is 
contemplated by the' scope of the present invention. 
Guide 31 includes end portions 29 and 27 which are af- 
fixed to the ends of adapter card 5 by attachment means - 
30. ' * : 

A guide rail 28 is provided which slidably receives 
the card guide 31, as shown in greater detail in Figure 
4. At least one shoulder portion 33 is included which fits, 
or conforms with the interior surface of guide rail 28 (see 
Figure 4). Pivot points 24 and 25 are rigidly affixed to 
guide rail 28. Pivot 25 is also rotating!/ attached to an 
elongated force transfer member 20 which has a lever 
portion 26 (see Figure 2). A second force transfer mem- 
ber 21 is rotatingly attached at one end to pivot 24 and 
rotating^ attached, at substantially the other end, to a 



-MSDOCID: <EP 0772134A1_L 



EP0 772 134 A1 



pivot 23 which is rigidly affixed to member 20. The end 
of force transfer member 20, opposite lever portion 26, 
is rotatingly attached to a pivot member 42 which is rig- 
idly affixed to frame member 43, or the like, as shown 
in Figure 5. It should be noted that pivot points 22 and 
23 also include a slotted opening about pivot pins insert- 
ed therein to provide some sliding movement (in the di- 
rections shown by the arrows in Figure 2) as card 5 is. 
removed from, or inserted into, connector 4 by raising 
or lowering the card. 

It can be seen that the arrangement of Figure 2 pro- 
vides downward vertical motion of card 5, as shown by 
arrow C, such that electrical tabs 9 will seat and connect 
with in-line connector 4 of Figure 1 ; Those skilled in the 
art will understand that if card 5 were directly attached 
to a pivot, then tabs 9 would approach connector 4 at 
an angle and it would be extremely difficult to insert card 
5 into connector 4 and make reliable mechanical and 
electrical connection. 

As shown in Figure 2, when force is exerted up-, 
wardly on lever 26 to disengage an adapter card, there 
is an upward vertical force at pivot 25, which is directly, 
transferred to card 5 at a point in alignment with electri- 
cal connection tabs 9. At the same time, an upward force 
is applied to member 21 through pivot 23 and trans- 
ferred to card 5 at pivot point 24. This provides a slight 
upward force on card 5, which prevents it from rotating 
as the card is removed from connector 4 and allowing 
the card to become easily decoupled from the connec- 
tor, both electrically and mechanically. The process is 
reversed when it is desired to insert a card 5 into a con- 
nector 4 on system board 1 . After guide 31 is attached 
to card 5, it is slid into guide rail 28. Downward force is. 
then applied to lever 26 and this force is transferred to 
card 5 through pivot 25. Since pivot 25 is. aligned with 
tabs 9, this downward force is exerted vertically and dh 
rectly on the connection tabs. The downward force on 
lever 26, also provides a downward force on member 
21 via pivot 23. This force is then transferred as a slight 
downward force to card 5 through pivot 24 to prevent 
the adapter card from rotating as it approaches' connect 
er 4. Thus, as described above, it can be seen how the 
apparatus of Figure 2, allows ah adapter card to be ver- 
tically inserted and removed from an in-line connector 
resident on a computer system board. The previous de- 
scription is one preferred embodiment of the present in-, 
vention, however, those skilled in the art will readily.com- 
prehend how other mechanisms, such as cam gears 
and the like could be used to provide an apparatus that . 
would allow vertical insertion and removal of an adapter 
card from a connector. 

Figure 3 is a view of card 5, taken along line A-A of. 
Figure 2 showing how guide member 31 , along with por- 
tions 27 and 29 are attached to the card using. attach-, 
ment means, such as screws 30, or the like. 

Figure 4 is* a side view of card 5, taken along line 
B-B of Figure 2. This view shows guide member 31 with 
its end portion 29 and attachment means' 30. As noted 



above, the shoulder portion 33 of guide member 31 con- 
forms to the interior surface 35 of guide rail 28 such that 
guide 31 , with card 5 attached thereto, can be longitu- 
dinally inserted into guide rail 28 in a slidable disposi- 
s Won. Pivot means 25 is also shown in Figure 4 and af- 
fixed to guide rail 28 in the same manner as shown in 
Figure 2. 

Figure 5 is a perspective view of a computer system 
having a cover 40 with slots 41 .formed therein. Two slots 
io 41 are shown in Figure 5. However, it is contemplated 
that any number of slots 41 can be formed in cover 40 
in order to accommodate the desired number of adapter 
cards 5. A frame member 43 is shown which isaffixed 
to a system board 1 (or another suitable support) inter- 
*5 nal to the computer. Pivot means 42 are also shown dis- 
posed on frame member 43 and which are rotatingly at- 
tached to pivot point 22 of the guide means of Figure 2. 
Also, pivot point (45 in figure 2) is rotatingly attached to 
frame member 43 or other suitable support to provide 
20 additional mechanical support for the guiding means of 
Figure 2. When cover 40 of Figure 5 is disposed to en- 
compass system board 1 of Figure 1 , the slots 41 will 
be in aligned relation with connectors 4 and 4a. Guide 
rail 28 is slid into frame member 43 and pivot point 22 
2$ is connected to pivot 42, while pivot point 45 is connect- 
ed to pivot 46; In this manner, the card guiding means 
of Figure 2 is also aligned with connectors 4 and 4a of 
system board 1. Guide member 31 is then attached "to 
an adapter card 5 and the entire assembly is slid into 
30 guide rail 28 with lever 26 extending outwardly from slot 
41 . To electrically install the adapter card 5 in the com- 
puter system, downward pressure is placed on lever 26 
until the electrical tabs 9 of the adapter card 5 are in 
electrical connection with, for example, lands lOofcon- 
35 nector 4. To remove a card, or change one adapter card 
for another, the process is reversed. That Is, upward • 
pressure is placed on lever 26 and tabs 9 of card 5 are 
disconnected from lands 10 of. connector 4. The card 5 
with guide rail 31 is then slid out of guide rail 28 and" a 
40 new, or replacement card is slid into guide rail 28. Again, 
downward.pressure is exerted on lever 26 to install the 
new card mechanically and electrically. 

It can readily be seen that the present invention lets 
a user change the computer hardware configuration by 
45 allowing adapter cards 5, such as a fax/modem, graph- 
ics accelerator, or the like, to be installed', or replaced in 
a computer system without the need for removing the' 
computer cover ,40. A computer system user merely 
needs to electrically isolate, or disconnect the connector 
so 4 from the CPU 2 and then install, remove or replace 
the adapter card 5. In personal computers, the electrical 
isolation may include merely powering off the machine, 
while the card is installed or removed. In more sophis- 
ticated systems, it may be necessary to try and isolate 
55 the particular connecter, or a group of connectors where 
a new, or different card is to be installed; without elec- 
trically disconnecting the remaining connectors. 

In most personal computers, workstations and serv- 
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ers the normal procedure for repairing or upgrade action 
in the I/O subsystem is to turn off the power, open the 
covers to gain access to the I/O area (connectors 4) and 
install, remove or replace the adapter card that is bad, 
or being upgraded. The covers are then replaced and 
the power restored. In server systems, it is becoming 
increasingly unacceptable to handle I/O repairing and 
upgrade actions in this manner, since many users are 
tied into the server across complex networks and would 
be shut down during the repair/upgrade action. * 

Some mainframe and high-end server machines to- 
day offer an expensive on-line maintenance capability 
by providing redundant systems. The present invention 
provides a relatively inexpensive and simple way to per- 
form on-line maintenance of I/O subsystems which al- 
low I/O cards to be replaced without opening the covers 
of the computer, and while allowing the system and oth- * 
er parts of the I/O subsystem to continue processing op- 
erations. 

Figure 6 shows the electrical connections for the 
various. components in an I/O subsystem of the compu- 
ter system. CPU 2 and memory 3 are- shown connected 
to system bus 1 00 such as the 60X or 6XX bus available " 
from IBM Corporation. A host bridge chip 11 3 is shown 
and provides an interface between system bus 1 00 and 
a mezzanine bus 102 used for input/output (I/O); such 
as the PCI. bus. Although Figure 6 shows a PCI bus'and 
PCI host bridge chip, the present invention contem- 
plates the use of any I/O bus. PCI bridge chip 113 con- 
tainsJogic and functionality that enables the bus proto- 
cols to be translated between system bus 100 and bus 
102, including interrupt handling, message passing, ar- 
bitration, snooping and the like. « 

Mezzanine bus 102 is connected to at least one PCI 
to PCI bridge chip 104. This chip provides the interface 
between the I/O bus and the actual adapter slot 106 
which includes a connector 4 and additional logic. Slot 
106 will receive an I/O device 108, which may be resi- 
dent on device 5. The PCI. architecture and specifica- 
tions are available from the PCI Special Interest Group 
(PCNSIG). The present invention adds additional con- 
trol logic as shown by reference numeral 105, but does 
not require modification of the PCI architecture. It should 
be noted that most computer systems will include more 
than one I/O slot,. as shown in Figure 6. The additional 
slots are represented by adding the letter "a - to the ref- 
erence numerals which are used to describe the com- ■ 
ponents of the present invention. • -.- ■ , .■ . «• 
As noted previously, in order for a system user to 
install, remove or replace an adapter card, the connec- 
tor, or slot (including a . bank of slots), must be isolated 
such that all of the processing activity at that slot, or 
bank of slots is ceased. One way to stop all activity is to 
merely turn the machine power off. However, this is of- * 
ten not practical for server type machines which inter- 
connect many client computers. This is particularly true 
in a fault tolerant,. or high availability system; Also, with 
the ava liability of multitasking systems, it may not be de- 



sirable to turn off the power of a single computer, when 
a particularty important activity is taking place. For ex- 
ample, a personal computer equipped with a fax/modem 
may need to remain powered on in order to receive a 
5 transmission. In this case, it would be advantageous to 
be able to deactivate a particular I/O slot(s), with the re- 
maining slots being in an active state. The present in- 
vention allows a user to replace a particular adapter card 
without the need of powering off a system, whether it is 
*o a server or personal computer. 

Figure 7 shows the components of the present in- 
vention, which allow adapter cards to be installed, re- 
moved or replaced, without the need to power off' the 
entire system. The I/O bus 102, e.g. a PCI bus, is con- 
's nected to a PCI to PCI bridge chip 104, and in combi- 
nation with the additional control logic 105, is used to 
control a single PCI slot 1 06. It should be noted that slot 
106 is considered the entire electrical and mechanical 
functional interface between secondary bus 103 and 
20 chip 1 04. This interface includes connector 4 as one por- 
tion, along with various other electrical and mechanical 
components; such as an electromechanical sensing de- 
vice 107, as discussed below. One modified bridge chip 
104 in conjunction with one set of control logic 105 is 
25 • used tocontrol one slot 106. Of course, this combination 
of elements will be replicated according to the number 
of I/O slots present in the computer system. In this man-, 
ner each slot can be selectively reset with a RST# sig- 
nal, and power removed from the slot when an I/O card 
30 is to be removed, replaced or installed. 

The planar, or system board 1 , will be modified to 
include the bridge chip 104 for each I/O connector 4. 
The bridge chip 104 is then used to isolate the second- 
ary bus 1 03 and slot 1 06 from the remainder of I/O bus 
35 102. When the slot.is empty/there is no power applied 
to the slot, such that a new card can be installed therein. 
If an I/O card is to be removed, it is first reset to assure 
that the adapter is not active during removal. The bridge 
chip 104 will take the slot 106 off-line, and with the aid 
• 40 of additional control logic, remove power from that card 
at the time it is reset. The card is then mechanically re- " 
moved, as previously described. Also, electromechani- 
cal means, such as a solenoid switch* or the like, can 
be provided to interlock the I/O card to prevent the card 
*s from being removed while ppwer W applied to the slot. 

For I/O card insertion, the card is inserted into the 
machine (the guide means, or the like previously de- 
scribed maytte used). Once the card is' in place, the 
system is configured to identify and initialize the new I/ 
50 O adapter card. Until the newly installed card is config- 
ured, the card slot 1 06 is electrically Isolated from the I/ 
O bus 102. When the card is configured, the logic on the 
planar provides for a ramp-up of power to the power pins 
on the connector 4. During the upgrade/repair action, 
55 only the I/O slot being reconfigured would be affected, 
allowing the system and other portions of the I/O sub- 
* system to remain in operation. It should be noted that • 
- above described operation does not require a, change 



'SDCCID: <EP 0772134A1_I_> 



5 



EP0 772 134 A1 



10 



. to the PCI (or other I/O bus) specification or architecture 
itself. That is, the present invention can be totally imple- 
mented without any modification to the I/O bus architec- 
ture. 

Control logic 105, as shown in Figure 7, includes 
slot reset detector 110, bridge control logic 112, power 
control logic 114 and LED driver 116. Also, a light emit- 
ting diode (LED) 11B is shown which is controlled by 
LED driver 116. 

The preferred embodiment of Figure 7 has been de- 
scribed as isolating an individual slot, however, the 
scope of the present invention includes isolating any 
number of slots greater than one, i.e. a bank of slots 
from other slots or banks of slots. By isolating a bank of 
slots, a single bridge chip 1 04 can be used to control the 
bank, thus eliminating the need to provide one bridge' 
chip 1 04 for each slot 1 06. Of course, some flexibility is 
lost when a single chip controls more than one slot, how^ 
ever, this may be desirable in some applications sys- 
tems where it is desired to reduce system costs and still 
be able to change cards without deactivating other sys- 
tem functions. 

Figure 8 is a flow chart that will be used in conjunc- 
tion with Figure 7 to explain the electrical operation of 
the present invention. 

In a first case, it will be assumed that there is an 
existing adapter card 5 in a PCI (or other I/O protocol) 
slot .106 which is to be removed. Referring to Figure 8, 
at step 1 the user initiates (by a sequence of keystrokes, 
selecting an icon with a mouse, or the like) the process 
for changing the system hardware configuration by re- 
moving, replacing or adding an adapter card. The proc- 
ess then determines whether a single adapter slot, or a 
bank of slots, controlled by a single bridge chip T04, r is * 
present in the system. If a bank of slots are present, then 
the method proceeds to step 17 of Figure 1 3 (discussed- 
below). If, it is determined at step la, that a single slot is 
present, then step 2 determines whether there is a card 
present in slot 106. Electromechanical sensing device 
107 provides the card presence signal to logic 114. In 
this example, the process will determine that a card ex- 
ists in slot 106, since it is being assumed that a J card is 
being removed. The user will initiate this process by in- 
putting commands, or the like to the computer system, 
via a keyboard; mouse, stylus, or other I/O device. 
These commands may require the user to provide cer- 
tain information,- such as which one of a plurality of slots 
106 is to be re-configured, or the like. - 

At step; 1 S 0, Xhe operating system, such as the Disk 
Operating System (DOS), OS/2, AIX, or the like" (OS/2 
and AIX are trademarks of IBM Corp.) causes'alfdata 
processing activity between the adapter 5 and the re- 
mainder of, the computer system to be ceased. Subse- 
quently, a reset RST# signal is issued from bridge chip 
1 04 to the I/O slot 1 06 (step 1 1 ). The RST# signal is also 
sent to reset detector ,1 1 0,-which in turn transmits a con- 
trol signal to bridge control logic 112. At step 12, the 1/ 
O bridge chip 104 decouples the secondary bus 103 



from the primary .I/O bus -1 02. This decoupling is accom- 
plished by a control signal which is sent from bridge con- 
trol logic .112 to I/O bridge chip 104. Based on the de- 
tection of the RST# signal slot reset detector 110 also 
5 sends a control signal to power control logic 114, indi- 
cating that the power to slot 106 should be gradually re- 
duced (ramped down). The. power is then decreased at 
step 13. 

Figure 11 shows one embodiment of a circuit which 
10 could be used by power control logic 114 to ramp the 
power to slot 106 up and/or down. The voltage Vdd is 
shown on rail 121 and connected to N-type transistors 
120, 1 22, 1 23 and 124 (N-type transistors conduct elec- 
tricity when a, voltage, i.e. ; logical 1 is applied to their 
75 9ate); Each of.these devices will have adifferent thresh- 
old voltage and present a different resistance when 
turned on, such that the. voltage drop across each of the 
transistors 'will be different. In the embodiment of Figure 
11, the devices will be sized where transistor 120 will 
20 have a large voltage drop and each of transistors 122, 
123 and 124 _will have a successively smaller voltage' 
d r °P- for example, if Vdd is assumed to be 3.3 volts and 
transistor .12:0 has a voltage drop of 2:5 volts, then at 
t=1 the voltage on rail 125 will be Vdd- 2.5 = 0.8 volts. 
2S |f transistor 1 22 is sized to give a threshold voltage drop 
of 1.5 volts, then at t=2, the voltage on rail 125 will be 
3.3 - 1 .5 = 1 .8 volts. Assuming for this example that tran- 
sistor 123 has a threshold voltage of 0.5 volts, then at 
. t=3, the voltage ; on rail 1 25 is 3.3 - 0.5 = 2.8 volts. And, 
30 it will be assumed, that transistor. 1.24 has a threshold 
voltage of substantially 0.0, such that at t=4, the voltage 
on raiM 25 is 3.3 - 0 = 3.3 volts, or Vdd. Thus, it can be 
seen how from time t=1 to t=4, the voltage on rail 125, 
which is connected to slot 106 is gradually increased 
3S (ramped up) from 0.8 volts to 3.3 volts. When, it is de- 
sired to gradually decrease the power to slot 106 (ramp 
down), the process is essentially reversed. In the steady 
state condition, transistor. 1 24 is turned on such that Vdd 
is provided to slot 106. To decrease the voltage on rail 
40 , 1 25,transistor.124[ is turned off by removing the voltage 
from its gate, and transistor 123 is turned on. Thus, 2.8 
volts is then on. rail 1 25, due to the threshold voltage of 
0.5 volts from device 1 23. During the next time period, 
transistor 1 23 is turned off and device 1 22 is turned on, 
45 and a voltage of 1 .8 volts will be on rail 1 25 because of ' 
the 1.5 yojuhresholdof device 122. Next; transistor 122 
is turned ^ off, .and transistor 120- is turned on placing a 
voltage of 0,8 volts on rail 1 25 due to the 2.5 volt thresh- 
old, of transistor 120 (step 13). -Of course, those skilled 
50 in the art will easily understand how the pulses at t=1 to 
t=4 can be varied by a clock generation circuit, and that 
additional transistors can be added to provide a more 
gradually .sloping. transition at slot 106 from no power 
(voltage =0) to fully powered (voltage = Vdd). Typical 
55 devices in slot 106 may require the voltage to be pow- 
ered down to 0.2 volts. Those skilled in the art will un- 
derstand how a wide range of voltage levels can be 
achieved with the circuit of Figure 11. 
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Further, power control logic 114 receives a confir- 
mation signal from bridge control logic 112 that indicates 
slot 106 has actually been decoupled from the I/O bus 
102. This will prevent any damage, data loss, or the like 
that could occur due to removing an actively coupled I/ 
O card. A card presence signal is also provided from 
slot 106 to power control logic 114 which confirms that 
there actually is a card 5 in slot 106. Once the power 
has been removed from sbt 106, a signal is sent from 
power control logic. 114 to LED driver 116, which in turn 
energizes LED 1 18 (step 1 4), thereby indicating to a us- 
er that the slot has been decoupled from the bus the 
slot has been de-energized and the card can now be 
removed (step 1 5) in accordance with the previously de- 
scribed mechanical guide -means, or the like (Figures 
1-5). in one preferred embodiment, electromechanical 
device 107 : such as a relay, solenoid switch, or the like 
can be used to physically prevent the card from being 
removed unless it has been powered down. Subsequent 
to step 15 the process of removing an adapter card from 
an I/O slot ends at step 1 6. It should be noted that those 
skilled in the art will understand there are many different 
implementations of control logic 105, and the present 
invention is not limited by any one particular implemen- 
tation. For example, any portion of the external control 
logic 105 could be incorporated into the PCI to PCI 
bridge chip 104, although additional pins on bridge chip- 
104 would be required. 

In the second example, it will be assumed that a 
card is being inserted into a slot on a computer system 
In this case, the card to be inserted is either new or is 
replacing another adapter card which has been re- 
moved in accordance with steps 10-15. Therefore at 
step 2 it is determined that there is hot an adapter card 
5 rn connector 4 of slot 106. Step 3 then ensures that 
the power has been removed from slot 106, as indicated 
by LED 118, and.the fact that a new card 5 cannot be 
physically inserted into a slot, due to the electromechan- 
ical device 107, if there is power applied to the slot At 
step 4, the-new adapter card 5 is-inserted intoconnector 
4 of slot 106, using the mechanical apparatus of the 
present invention as described in-conjunctioh with Fig- 
ures 1-5. Electromechanical device 107 will then issue 
the card presence signaJ to power control logic 114 
thereby indicating that new card 5 is physically present 
in slot 106 (step 5). Receipt of the card presence signal 
by logic 114 indicates that electrical power can now be ' 
gradually applied to slot 106 through the slot power and 
slot ground power distribution lines using apparatus 
such as previously described in accordance with Figure 
1 1 (step 6). Once slot 1 06 is powered up, the power con- 
trol logic 1 1 4 then provides a control signal to LED driver * 
116 which causes the LED to be turned off indicating to 
the user that power is now applied to the slot and the 
card cannot be removed. At step 7, the power control 
logic issues a connect bus control signal to bridge con- 
trol circuit 112, which in turn sends an enable signal to 
the I/O bridge chip 1 04, thus causing secondary bus 103 



to be coupled with the primary I/O bus 102. The FtST# 
signal from bridge chip 104 is then deactivated at step 
8. At this time the new card 5 is physically present in 
connector 4, with the power applied to slot 1 06, and the 
5 secondary bus 103 connected to I/O bus 102. All that 
remains is for the software in the computer system to 
begin configuration activity, such as determining what 
type of card has been installed and type of protocol it 
uses (step 9). The configuration software may read a 
" read only memory (ROM) on the adapter card to make 
these determinations. Subsequent to configuration, da- 
ta processing activity using the new card can begin The 
installation process is then complete and the method of 
Figure 8 ends at step 10. 

Figure 1 2 shows a block diagram of an embodiment 
of the present invention wherein a bank of slots 1 06 are 
controlled by a single bridge chip 104. These slots can 
then be controlled, i.e. deactivated, as a group. Refer- 
ence numerals in Figure 1 2 corresponding to the same 
20 numerals used in Figure 7 are intended to represent 
identical components and will not be discussed again 
It can be seen that reset detector 110 provides a control 
signal, based on reset signal RST# to an arbiter 130 
This arbiter is- a standard logic device which receives 
25 requests for ownership of the secondary bus 1 03 and 
then awarcJs the bus to the bridge chip 104, or one of 
the slots.106, based on a set of predetermined criteria 
e.g. the device which least recently had access to the 
bus. Arbiter 130 is shown as being connected to bridge 
30 chip 104,. but js also connected to each slot 106 through 
the bridge chip. Request lines 131 are shown which 
transmits a. bus request signal from slots 106 to arbiter 
1 30, via bridge chip 1 04. Those skilled in the art will un- 
derstand that bus 1 03 contains many other control sig- ■ 
55 nal lines, such as an arbitration grant line, and the like 
which indicates to a particular slot that the bus has been 
awarded to a particular slot subsequent to an arbitration 
cycle. Other lines accommodating data and address sig- 
nals are also included in bus 103, but not shown in Fig- 
ure 1 2. A set of .in line switches 1 33 are placed in request 
lines 131 and controlled by switch control logic 117 It 
should be noted that there will be one set of switches 
for each slot present in the bank. Upon detection of the- 
RST# signal from bridge chip 104, switch control logic 
4 * 117, sends a control signal io arbiter 130 which then 
. awards ownership of bus 103 to bridge chip 104 This 
ensures that none of the slots 1 06 in the bank have own- 
ership of the bus 103 when the process of deactivating - 
the bank of slots is initiated. Concurrently, with the signal 
so sent to arbiter 1 30, switch control.logic 117, also sends - 
a control signal to switches 1 33, which opens the switch- • 
es, thus, preventing any of the cards in the slots 106 
. from requesting access to bus 103 and initiating an ar- 
bitration cycle. Once, arbitration is disabled, then the 
« f bank of slots 106 can be deactivated using the same 
techniques, described above with regard to Fiqures 7 
and 8. , 

The flowchart of Figure 1 3 will now be described in 
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conjunction with Figure 12. If at step la of Figure'8 de- 
termines that a bank of slots is present in the computer 
system, then step 17 of Figure 1 3 determines if the card, 
or cards, to be removed or replaced is in or for one of 
the slots in the bank. If so, then at step 18 slot reset s 
detector 110 provides a control signal to switch control 
logic 1 1 7, which in turn provides a signal to arbiter 1 30. 
At step 19, arbiter 130 awards ownership of bus 103 to 
bridge chip 104. Switch control logic 117 then disables 
bus request signal lines 131 by opening switches 133 io 
(step 20). At step 21 the configuration software stops 
activity to the feature cards in slots 106. Bridge chip 1 04 
then decouples secondary bus 103 from I/O bus 102 at 
step 22. The power to the bank of slots is then gradually 
decreased at step 23. At step 24, LED 118 indicates is 
when the power is removed from slots 1 06, and the card, 
or cards can then be removed (step 25). 

If at step 17, it is determined that a card is to be 
inserted into one of the slots 106 in the bank, then the 
slots in the bank will be inactive (step 26) since the bank 20 
has previously been deactivated in accordance with 
steps 1 8-25. At step 27 the card(s) 5 to be added to the 
computer system are inserted into connector(s) 4. Elec- 
tromagnetic switch(es) 1 07 then indicates the presence 
of the card(s) (step 28). The power to the bank of slots 25 
is then gradually increased at step 29 and an indication 
is given that bus 103 can be coupled to I/O bus 102 by 
bridge chip 104.,At step 30 : bridge chip 104, then recon- 
nects secondary bus 103 to I/O bus 102. The reset sig- 
nal is deactivated at step 31 . This causes the' switch con- 30 
trol logic to enable arbitration for bus 103 by closing 
switches 133 (step 32). The configuration software for 
each card in the bank of slots allows the card(s) 5 in the 
bank of slots to begin data processing activities (step 
33). Subsequent to both steps 25 and 33, the process 35 
continues to step 16 (Figure 8) and ends. ' . '" 

In computer systems having a particular type of I/O 
bus, such as the PCI bus, it is impossible (in some cas- 
es) to report errors which occur on the I/O bus and allow 
for recovery from those errors. For example; address *o 
parity errors are reported with a system error signal 
(SERR#). This signal will generate a non-maskable in- 
terrupt (NMI) signal to the central processing unit. A 
problem arises because in many systems; an NMI" is 
non-recoverable and any error reported with an NMI will 45 
cause the computer system to be restarted. That is,^ 
there is no error, recovery code for NMIs' and the com- 
puter system must go through its ihitiarprbgram ioad 
(I PL) in order to resolve the error condition'. This is un- 
desirable in computer systems, such as servers, where so 
re-IPL of- the system will cause all of the client systems 
connected to the server to also be restarted. In this case, 
even those client systems which are error free will have 
to be re-IPLed, since the server machine will respond to 
the NMI with a machine check. ' 55 * 

Additionally, client systems, such as personal com- 
puters which have multiple feature cards in various slots 
will be adversely affected if one of the feature cards, or 



devices issues a NMI. That is, if a single card issues a 
NMI to the CPU, the only recourse is for the CPU to re- 
IPL. This is because the CPU is unable to identify which 
feature card has the error condition which caused the 
NMI to be issued. 

Further, the SERR# signal is sometimes driven by 
devices (i.e. cards) to indicate that an unserviceable in- 
ternal error condition exists. Typically, the SERR# signal 
for various devices is ORed together with other SERR# 
signals, such that the CPU does not know which device . 
has initiated the signal, why it has been issued, or if there 
is more than one device issuing a SERR# signal. An- 
other example of an unrecoverable error is.substantially 
all errors which occur when the operation being per- 
formed has been posted by a slave device (adapter 
card) for future completion by a master (CPU), and the 
master does not complete the operation. This type of 
error applies to all programmed I/O (PIO) operations (via 
load and store instructions), used in conjunction with 
many different types of commercially available micro- 
processors, which are destined for the PCI memory ad- 
dress space. Thus, the system software can write data 
to an I/O, device, e.g. a PCI device, and since the oper- 
ation completes successfully on the processor bus, the 
software program continues operations. Any error that 
subsequently occurs on the PCI bus will then be too late 
for the software to correct the problem. 

In another embodiment of the present invention the 
I/O protocol can be altered to a minor extent in order to 
allow recovery of errors on a PCI (or other similar I/O 
bus) bus. In. order for this error recovery to be possible, 
each slot must be isolated such that the CPU can de- 
termine the type of error and which card is issuing the 
error signal. . 

Figure 9 is a block diagram of a preferred embodi- 
ment of the error recovery aspect of the present inven- 
tion. It should be noted that components referred to by 
numerals in Figure 9 correspond to the same compo- 
nents used in Figures 7 and 12 and will not be discussed 
again. In Figure 9, system bus 1 00 connects CPU 2 and 
memory 3 to bridge chip 113. CPU 2 has a software op- 
erating system 200, s.uch as the AIX or OS/2 operating 
system. Also device drivers 201 are installed on CPU 2, 
and may be included in operating system 200. These 
device drivers 201 are used to control the various com- 
ponents,, including the feature cards 5 in slots 106, of 
the computer system. Device drivers 201 performs such 
functions and communications, error detection and cor- 
rection, and the like. I/O host bridge chip 11 3 is connect- 
ed to system bus 100 and also to I/O bus 102. Bridge 
chip 104 is then connected to I/O bus 102 and slot 106. 
In the currently described embodiment, at least one ad- 
ditional register 203 is added to bridge chip 1 04 for stor- 
ing status information. Further, it can be seen from Fig- 
ure 9 that signal line 103 is used to transmit the reset 
signal RST# to slot 106. And, signal line 204 will provide 
the SERR# signal from slot 106 to bridge chip 104. The 
remainder of the components in Figure 9 are identical 
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to those shown in Figure 7. anddescribed in accordance 
therewith: 

The key to the error recovery scheme is to isolate 
each I/O device, i.e. each I/O slot 106. from the primary 
/O bus 1 02 with a modified bridge chip 1 04. In this pre- s 
ferred embodiment a modification to the previously ref- 
erenced PCI to PCI implementation is necessary. More 
speaf.cally, a recovery mode configuration bit is added 
that will be set when any error condition is present on a 
particular one of the cards in a slot 106. When the re- w 
coyery mode configuration bit is set, the RST# signal 
w.ll be activated and held, in order to keep the device 5 
«n .ts reset state to prevent any damage from being 
caused to the system, as described above Further a 
status bit in register 203 is set to signal an external 'in- is 
terrupt to the system. Also, when the configuration bit is ■ 
set any further, loads or stores from the CPU 2 to the 
device are ignored by throwing away any data from the 
CPU on a store, and returning a value of all logical ones 
on any load operation. Finally, any direct memory ac- so 
cess (DMA) data from the device 5 is discarded and any 
operation that would pass on the DMA data are aborted 
The device driver 201 has responsibility for check- 
ing the status of any I/O operations at either the bridge • 
chip 1 04, or the device itself to make sure that the op- 2s 
erafon is completed correctly at specific points in the 
code (instructions being executed). Register' 203 will 
contain some bit where, e.g. a logical 0 will indicate that 
there ,s no error present and the device driver can read 
he information from the I/O device. However, if the sta- so 
tus bit m register 203 contains a logical 1 and the bridge 
ch.p 104 is holding device 5 in the reset state (RST# 
active), then when the device driver reads the informa- 
tion from the device all the bits will be setto logical ones 
thus, indicating to the driver that the operation did not" ss 
complete properly. It should be noted that errors on the 
primary I/O bus 102 will still generate a machine check 
causing a re T IPL of the system. However, by using the 
isolation methodology of the present invention the pri; 
mary I/O bus 102 does not have any slots 106 directly 40 
connected to it, thus dramatically increasing its reliabil- 

Further, the computer system can be designed'so - 
that only specific ones of the devices 5 will participate 
in this reset' type of error recovery. When the error re- # 
covery of this embodiment is not turned on, then errors 
are passed on from the devices to- the primary l/o bus ' 
102, with the result that a machine check will probably^ 
be generated. It may be acceptable for certain systems 
to be , designed wherein only the critical devices (e.g so 
DASD and LAN adapters in server systems) which haS 

dlethemajorityofdatainthesystemneedtobemodified ■ ' 
to take advantage of the error recovery scheme of the ' 
present invention. In this manner, the reliability of the 
system can be greatly increased without the need for ss 
modifying the entire computer system. 

Figure 10 is a flow chart showing the steps imple- 
mented by the error recovery aspect of the present in- 



vention. At step 1 the process is started and at step 2 
the device driver performs any load/store operations to 
the device being controlled. It should be noted that the 
present invention also addresses the situation wherein 
a string, or related group, of load/store operations are 
<^ooT P ' emented Ste P 3 ,ne " determines whether an 
s, 9 nal is Present from one of the plurality of de- 
vices on the adapter cards in the computer system If 
so then at step 4, the reset signal RST# is activated (by 
bndge chip 104) to the device signalling SERR# to 
place the device 5 in its reset state and avoid any dam- 
age to the system, while still keeping the device coupled 
to the system. That is, the slot 106 having the feature 
card wh,ch issued the SERR# signal is reset in the man- 
ner as previously described (data processing activity is 
ceased). At step 5, the status' bit in register 203 is set 
ejg. to logical 1 Next, at step 6, the control hardware as 
shown in Figure 9 will ignore all load and store opera- 

!^ S ;, and ab ° n any pendin 9 direct me "iory access 
(DMA) operations. If at step 3 it was determined that 
there was no SERR# .present, then the process of the 
present invention continues to step 7 where it is deter- 
mined if there are additional load and store operations 
m the string of instructions being implemented If there 
are additional load and/or stores, then the process loops 
bacjc to step 2 where the device driver implements the 
load/store. If there are no additional load/store opera- 
tions, then at step 8 the device driver reads the status 
bit ,n register 203 of bridge chip 104. Step 9 then deter- 
mines .fan error condition has occurred. If at step 5 the 
status bit was not set to indicate that an SERR# error 
has occurred, then the load/store operations in consid- 
.ered.to have completed (step 10). However, if at step 5 
J^™^ was set to indica 'e the presence of an 
bERR* signal, then bridge chip 104 is reconfigured (by 
reinitialization) at step 11. Typically, the device driver 
will reset the. feature card by reinitializing the device 
However, .the present invention contemplates that the 
device driver may also attempt a retry operation that 
would tell the bus master device which is attempting to 
transfer information between itself and the device to at- 
tempt the transfer operation again. If the error condition 
has been removed, then the -load/store operation may 
be implemented correctly. Further, at.step 1 1 , the device ■ 
driver may call one or more service routines which will 
attempt to correct the error condition in the device - 
These error routines may reside in computer's read onfy 
memory (ROM) as part of the power on self test (post) " 
code, or the like. However, the typical situation is for the * 
device driver to re-initialize the device having the error- 
condition. In accordance with the present invention only 
the particular devi.ee which actually generates the error 
code with be re.-IPLed. The remaining devices on the 
plurality of feature cards in the computer system will 
continue normal data processing activities. Thus it can 
be seen how the present invention allows a computer 
system to isolate a single device in a particular I/O slot 
106, without affecting the operations .of other devices on 
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other cards 5 in different slots. 

At step 1 2, the particular device generating the error 
code is then re-initialized by the device driver. The de- 
vice driver is then set back to a checkpoint state for nor- 
mal data processing activities (step 1 3). That is, the de- 
vice driver has initialized the device and is controlling its 
activities in a normal manner, e.g. by implementing load 
and store operations to transfer information between it- 
self and the device being controlled. This also includes 
determining when an SERR# signal has occurred in the 
device being controlled, as shown by step 3. It can be 
seen that subsequent to step 13 the process loops 
backs to step 2 and continues. \ 

It can be seen how the present invention will greatly 
improve reliability by allowing error conditions to be cor- 
rected on individual feature cards, without the need to 
power down the entire computer system... 



Claims 

1. A computer system, comprising: 
a CPU (2); 

at least one I/O slot (106), electrically connect- 
ed to said CPU, for receiving a feature card (5); 
and 



3. 



means (105) for changing a hardware configu- 
ration of said computer system by deactivating 
said at least one I/O slot while said CPU con-, 
currently performs data processing operations'. 

A system as claimed in claim i further comprising - 
means (109) for determining whether said at least : 
one I/O slot (106) is empty. 

A system as claimed in claim 2 further comprising 
a bridge chip (104) for electrically connecting said 
I/O slot (106) to a bus (102). , 

A system as claimed in claim 3 wherein said means 
(105) for changing comprises means-for ceasing 
data processing activities by said feature card (5) • 
in said at least one I/O slot (106). 

A system as claimed in claim 4 wherein, said means 
(105) for changing further comprises: 

means (112) for activating a reset control sig- 
nal; 
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A system as claimed in claim 5 wherein said means 
for changing further comprises means for causing, 
in response to said means for resetting, said bridge 
chip (104) to decouple said at least one I/O slot 
(106) from said bus (102), and for reducing electri- 
cal power to said at least one I/O slot. 

A system as claimed in claim 6 wherein said means 
for changing further comprises means (1 1 6, 1 1 8) for 
indicating when said at least one I/O slot (106) is 
deactivated and said feature card(s) can be re- 
moved. 

A system as claimed in claim 7 wherein said means 
for changing further comprises: 

means (107) for detecting when a new feature 
card to be installed in said at least one I/O slot 
(106) is inserted into a connector (4, 4a); and 

means (12V, 122, 123, 124) for increasing, in 
response to detection of said new feature card, 
electrical power to said connector. 

2S 9. A system as claimed in claim 8 wherein said means 
for changing further comprises means for causing, 
in response to. detection of said new feature card 
(5) : said bridge chip (104) to couple* said at least 
one I/O slot (106) to said bus (102), and for deacti- 
vating said reset control signal. 

10. A system as claimed in claim 9 wherein said means 
for changing further comprises means for initiating 
data processing activities for said new feature card 
(5) at said,at least one I/O slot (106). 

11. A method of changing a hardware configuration in 
a computer system having a CPU, comprising the 
steps of: 

providing at least one I/O slot, electrically con- 
nected to said CPU, for receiving a feature 
card; and 

deactivating said at least one I/O slot while said 
CPU concurrently performs data processing 
operations. 



so 



30 



35 



40 



45 



means (110) for detecting said reset control sig- 
nal; and' . 



55 



means for resetting said feature card currently 
in said at least one I/O slot. 
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